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(54) Message broker providing a publish/subscribe service and method of processing messages 
in a publish/subscribe environment 

(57) Provided is a message broker and a method of 
processing message content within a publish/subscribe 
message distribution service. The message broker and 
method extends publish/subscribe service capability to 
heterogeneous communications networks in which dif- 
ferent subscriber systems can have very diffierent capa- 
bilities in terms of their multimedia presentation 
capabilities, processing capabilities and storage capa- 
bilities. The broker modifies the content of messages 
received from publisher application programs to con- 
form to the subscriber system capabilities prior to distri- 
bution to subscriber applications. The broker is adapted 
to identify characteristics of the content of received 
messages and to use this information to determine 
which of a set of message handler modules to pass the 
message to. Derived messages are generated by the 
selected message handler module and sent to sub- 
scribers for which this message type or message con- 
tent has been requested. A multimedia processing 
module is preferably selected in response to identifica- 
tion of certain multimedia content. 
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Description 

FieM Qf Invention 

[0001] The present invention relates to a message 5 
broker providing a publish/subscribe service, and to a 
metliod of processing message content in a pub- 
lish/subscribe environment. 

Background io 

[0002] Publish/subscribe is a known information 
distribution medianism in wliich Information providers 
(Pubiisliers) send infonnation electronically across a 
network to a community of infonnation users (Subscrib- 15 
ers) who have chosen the topics of information they 
wish to receive. Typically, publishers publish a single 
message to a broker. The broker then decides from its 
registered subscriptions where to send copies or deriv- 
atives of the message. 20 
[0003] Known publish/subscribe mechanisms rely 
on distribution lists, typically executing very simple logic 
to achieve required routing of highly structured, small 
transactional messages. Subscriber applications can be 
registered to receive all messages which include a spe- 25 
cific subject classification or keyword, such as a com- 
pany name or a more specific subject, within the 
message header or within a particular content field of 
the structured message. Message dictionaries hold 
information about the structure of messages, providing 30 
the message broker with a definition of the structure of 
all messages that will pass through it and so enabling 
unpacking of fields from the message content for 
inspection. The message broker queries the message 
header or a field within the content to retrieve the sub- 3S 
ject classification or keyword, and this is then compared 
with a list of subscriber applications' information 
requirements and any other stored rules for routing 
messages to determine which applications the mes- 
sage should be sent to. 40 
[0004] Knowledge of the registered applications' 
required message formats also enables message for- 
mat transformations (for example, an incoming mes- 
sage's text content may be in COBOL whereas a 
subscriber may require this content to be converted to 45 
text within an e-mail memo). An example apparatus and 
method providing this capability is described in US pat- 
ent 5,187.787. 

[0005] These known publish/subscribe mecha- 
nisms can only operate with highly structured mes- 50 
sages, since the simple logic involves comparison of the 
contents of only a specific message field with entries In 
a distribution list. Furthermore, known publish/sub- 
scribe mechanisms have typically only used simple 
logic for routing determination and format transforma- 55 
tions and have not extended the publish/subscribe envi- 
ronment to more complex processing of message 
content which is not text-based. Since modem commu- 



nications increasingly Include non-text based mes- 
sages, such as the sending of graphics and video over 
the Internet, these limitations of the prior art systems 
are a significant problem. 

[0006] Furthermore, the typical network environ- 
ments in which publish/subscribe mechanisms have 
been implemented Include high performance woricsta- 
tlons as subscriber computer systems with the broker 
running on a server computer to which each of the sub- 
scriber systems connects. The subscriber systems are 
thus typically homogeneous in terms of their system 
capabilities. The individual information requirements of 
users at each of the wori(stations have typically also 
been broadly consistent, since the broker's server sys- 
tem and the subscriber systems have typically been 
implemented within a single organisation for managing 
distribution of data to subscribers within the organisa- 
tion. It is very common for modem communications and 
data processing networics to comprise heterogeneous 
networks in which different systems have very different 
capabilities. Known publish/subscribe services do not 
take account of these differences. 

Summary of Invention 

[0007] In a first aspect of the present invention, 
there is provided a message broker software compo- 
nent supporting a publish/subscribe service wherein the 
broker is adapted to receive and store information from 
subscriber systems regarding their capabilities, and 
wherein the broker is responsive to receipt of a mes- 
sage from publisher application programs to modify the 
content of the message to conform to the subscriber 
system capabilities and then to send derived messages 
Including the modified message content to subscriber 
application programs at the subscril}er systems. 
[0008] Subscriber "systems" in this context may be 
any network-connected data processing or data presen- 
tation apparatus or device, such as a computer system, 
a personal digital assistant (PDA), a device controller 
embedded within an apparatus (such as in a vehicle), a 
mobile telephone or message service centre, or a 'set- 
top box' connected to a television set. 
[0009] The subscriber systems' capabilities are 
preferably characteristics which are relevant either to 
the presentation of message content at the subscriber 
system or to the processing or storing of the message 
content at the subscriber system. Examples of the 
capabilities of the subscriber system to process and to 
store message content include the processor speed 
and available storage space. Examples of capabilities 
relevant to presentation include: 

graphics capabilities such as the resolution or pixel 
depth of the display apparatus of the subscriber 
system, the graphics mode (type of display), the 
compression schemes supported, preferred Image 
format and other image model parameters; 
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• system limitations relevant to text documents such 
as a maximum document buffer size; 

audio capabilities, including whether the subscriber 
system supports audio playback and if so which for- 
mats; 

video capabilities, including whether the subscriber 
system supports video playback and which formats; 

characteristics which have requirements for busi- 
ness information (i.e. content-specific requirements 
as distinct from the more generic system limitations 
relevant to text documents described above) - these 
requirements may be data type conversions (such 
as text to voice for a mobile telephone); and 

• the application programs which are available on the 
system. 

[0010] Exposing these system characteristics or 
capabilities to the broker enables the broker to modify 
message content to conform to the capabilities. For 
example, a computer system within a vehicle may 
require transcoding of the text of received messages 
into audio, and may be unable to play video files, and 
the display screen may be low resolution. As another 
example, algorithms at the broker may select an 
abstract or generate a precis of an oversized document 
which exceeds a PDA's buffer limitations. 
[001 1] In addition to processing message content to 
conform to the above subscriber system capabilities, 
the broker will typically also perform any required text 
format conversions (such as ASCII to EBCDIC) and fil- 
tering of information in accordance with user-selection 
of required topics. These operations are known in the 
art. 

[0012] The invention has the advantage of enabling 
a publish/subscribe information distribution mechanism 
to be extended to handle any type of message content 
including multimedia content whilst enabling implemen- 
tation of this extended publish/subscribe capability 
within a heterogeneous network In which Individual sub- 
scriber systems may have very different capabilities. 
The invention provides a broker which can take account 
of the subscriber system capabilities and adapt the 
message content which it sends to the subscriber sys- 
tems to conform to the presentation capabilities or other 
capabilities of the individual subscriber systems. This 
may involve modifying both text and non-alphanumeric 
message content for prefen-ed presentation mecha- 
nisms and formats. 

[001 3] There are significant advantages of process- 
ing message content at the broker to conform to sub- 
scriber system capabilities instead of leaving subscriber 
systems to perform the processing. In a pervasive com- 
puting environment where subscriber systems include 
systems with very limited memory and processing 



power, it may be impossible to perform the necessary 
modifications of message content at the subscriber sys- 
tem. Secondly, the subscriber systems' experience of 
communication link performance can be improved by 
5 avoiding transmission of excessive sized messages 
which include more content than Is useful to the sub- 
scriber. 

[0014] A prefenred implementation of the collecting 
of subscriber system capabilities at the broker involves 
10 a message service agent at each subscriber system ini- 
tiating a query of the system capabilities In response to 
a user initiating a request for an application program on 
the subscriber system to be registered as a subscriber. 
This query is preferably only run when the first subscrip- 
ts tion registration request is initiated at each subscriber 
system. The system capability Information Is then pack- 
aged and sent to the broker together with user-specified 
requirements for information subjects. The broker then 
stores the capabilities information in association with 
20 the subscriber subject requirements. In response to 
published messages being received at the broker, the 
broker identifies the message content and uses the 
stored subject requirements to determine which sub- 
scribers have registered to receive this message con- 
25 tent and uses the stored capabilities information to 
determine what processing Is required to match the 
message content to subscriber system capabilities. 
[0015] The processing to conform to a subscriber 
system's capabilities may be performed subsequent to 
30 performing the steps of determining who the Interested 
subscribers are and comparing the content characteris- 
tics of a received message with the interested sub- 
scriber's system capabilities. Alternatively, a plurality of 
derived messages may be generated for received mes- 
35 sages having certain message content characteristics 
and then a check performed of which of the derived 
messages should be sent to each interested subscriber 
system. 

[001 6] The identification of message content prefer- 

40 ably includes an analysis to identify specific non-atpha- 
numeric data types within the message content, and 
this information is then used together with the sub- 
scriber capabilities information to determine the 
required processing by the broker. The user-specified 

45 requirements may also include an identification of par- 
ticular processing operations which should be per- 
formed by the broker or by selectable processing 
modules associated with the broker. The analysis of 
message content and user-specification of require- 

50 ments which identify required processing are described 
in the co-pending UK patent application number 
GB9906231.7 which Is assigned to International Busi- 
ness Machines Corporation. These features will be 
described in more detail later. 

55 [0017] In a second aspect of the invention there is 
provided a method, implemented at a message broker 
which is adapted to provide brokering services within a 
network, for modifying the content of messages 
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received from publisher application programs prior to 
sending messages to sut>scriber application programs, 
the method including: analysing the content of received 
messages to identify content characteristics; comparing 
identified content characteristics with stored information § 
regarding subscriber requirements to identify interested 
subscribers; comparing identified content characteris- 
tics with stored information regarding subscriber system 
capabilities to determine processing requirements for 
modifying the message content to conform to the sub- to 
scriber system capabilities; and sending to the sub- 
scriber systems derived messages including message 
content which conforms to the subscriber system capa- 
bilities. The processing to generate derived messages 
may be performed either before checking individual sys- 15 
tern capabilities (i.e. generating a set of versions of 
modified message content and only subsequently 
determining which of these messages should be sent to 
a subscriber) or afterwards (i.e. using the capability 
information to generate a derived message which con- 20 
forms). 

[0018] In a third aspect, there is provided a mes- 
sage service agent software component for a sub- 
scriber system of a data communications network which 
includes publisher application programs located on a 25 
first plurality of computers sending messages to a bro- 
ker on a server computer which then fbnwards mes- 
sages to the subscriber systems, the message service 
agent being adapted to: respond to initiation at the sub- 
scriber system of a subscription registration request, to 30 
query subscriber system capabilities; and to send to the 
broker system capability information obtained by the 
query together with the sut>scription registration 
request. 

[0019] In a fourth aspect of the present invention, 35 
there is provided a message broker supporting a pub- 
lish/subscribe service wherein the broker is adapted to 
identify characteristics of the content of messages 
received from publisher application programs and to 
use the identification of message content characteris- 40 
tics to determine which of a set of message handler 
modules to pass the message to. The selected mes- 
sage handler module processes the message and then 
one or more derived messages are sent to subscribers 
for which this message type or message content has 4s 
been requested. 

[0020] The function of Identifying message content 
characteristics and logic for determining which of a set 
of message handler modules to pass the message to, 
and preferably also the processing functions of one or 50 
more message handler modules, may be implemented 
within a replaceable software component which interop- 
erates with and controls the operation of a message 
broker. This separation of certain functions into a 
replaceable component (such as a Java Bean or 55 
ActiveX component) enables the behaviour of a mes- 
sage broker to be modified according to the particular 
software component which is used with the broker. 



[0021] The message broker may comprise a distrib- 
uted broker infrastructure, preferably with interoperating 
message broker software elements installed on each of 
a plurality of data processing systems. Messages are 
passed to different processing systems according to 
each system's capabilities and the identified message 
characteristics. For example, one processing system 
may be connected to a database to provide efficient 
SQL database access whereas a second system may 
have particular capabilities for image processing such 
as the ability to perform fast floating point operations. 
Thus, intelligent selection of message brokers of a dis- 
tributed broker service can be achieved using the 
present invention's identification of message content 
characteristics. 

[0022] In a fifth aspect, the invention provides a 
message broker supporting a publish/subscribe service 
wherein the broker is adapted to Identify non-alphanu- 
meric message content (i.e. content which Is not text- 
based or numerical or a digital or other encoded repre- 
sentation of text or numbers) within messages received 
from publisher application programs and to use the 
identification of non-alphanumeric message content to 
determine which of a number of message handler mod- 
ules to pass the message to. The selected message 
handler module processes the message and then a 
derived message is sent by the broker to subscribers for 
which this message type or message content has been 
requested. 

[0023] One or more of the selectable message han- 
dler modules are preferably multimedia processing 
modules, adapted to perform one or more specific 
processing operations on multimedia content within the 
message and then to pass one or more resulting modi- 
fied or derived messages back to the broker. Recogni- 
tion that a message contains non-alphanumeric content 
causes the broker to pass an incoming message to a 
processing module which is adapted to handle such 
content, such as a multimedia processing module. The 
processing module manipulates the content of the mes- 
sage and passes one or more derived messages back 
to the message broker, and It Is the broker which then 
sends the derived message to the relevant subscriber 
application programs. 

[0024] Thus, publish/subscribe capability has been 
extended by the present invention to support any type of 
message, including messages which include non- 
alphanumeric content such as multimedia content and 
to subscriber requirements which include a requirement 
for a message broker or associated message handler 
module to process the non-alphanumeric content. Prior 
to the present invention, identification and retrieval of 
any non-alphanumeric message content would have 
required a specific query to be constructed for each 
message scanned and so intelligent delivery of multi- 
media in a form required by subscribers could not be 
handled using known publish/subscribe mechanisms. 
[0025] According to one embodiment of the inven- 
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tlon, the message broker performs a determination of 
what particular operations are to be performed by the 
selected multimedia processing module according to 
specified requirements of subscribers and according to 
characteristics of the received message. Alternatively, a 
selected processing module may perform a standard 
set of operations regardless of subscriber requirements, 
with the broiler only subsequently checking which sub- 
scribers require which type of output. 
[0026] The non-alphanumeric content may be, for 
example, audio or image data or any binary data includ- 
ing executable program code. One example Is where a 
broker is connected to receive messages containing 
waveform data representing the material broadcast by a 
radio station. The broker recognises that the message 
includes audio content and passes the message to a 
particular message handler module which performs the 
following functions: retrieves a watermark from the 
audio data, compares the watermark with a database of 
copyright owners and their material to identify the rele- 
vant owner and material, and generates a message to 
be published to the copyright owner informing them that 
a particular copyright work has been broadcast by a 
particular radio station at a particular date and time. 
This will allow the copyright owner to seek royalty pay- 
ments. A second example is where an incoming mes- 
sage includes raw Image data captured by a satellite. 
The incoming message is processed to create sets of 
different resolution images, such that the appropriate 
resolution images are then available for delivery In 
response to different subscriber requests. Satellite 
images can also be processed in order that the pub- 
lished messages contain derived Information such as 
crop classification maps. 

[0027] Note that the above examples involve the 
processing modules significantly modifying or interpret- 
ing the received message content - the messages sub- 
sequently delivered to subscribers may include 
extracted, processed or derived data which is pertinent 
to the particular subscriber's requirements. There may 
also be examples where it is appropriate for messages 
sent to subscribers (refenred to herein as "derived" mes- 
sages) to Include identical content to the messages 
published by publishers, as determined by the message 
content characteristics and/or subscriber requirements. 
The invention's identification of multimedia content and 
consequent selection of an appropriate handler module 
is equally applicable to these latter cases. 
[0028] The message broker according to the pre- 
ferred embodiment of the invention is not limited to a 
single, specific message structure or to scanning of the 
contents of a particular field within a structured mes- 
sage, but is adapted to scan the content of a received 
message and to identify non-alphanumeric elements 
within the message. For example, the broker may be 
adapted to look for a topic string with certain topics hav- 
ing an expectation of non-alphanumeric content, and 
then to look for recognisable byte sequences which are 



representative of the structure of particular image for- 
mats (for example GIF, TIFF and JPEG files have a rec- 
ognisable initial byte sequence). 

5 Brief Description of Drawings 

[0029] Preferred embodiments of the invention will 
now be described In more detail, by way of example, 
with reference to the accompanying drawings In which: 

10 

Figure 1 1s a schematic representation of a compu- 
ter network implementing a first embodiment of the 
invention, including publisher application programs 
sending messages to a broker and subscriber 
15 application programs receiving messages from the 
broker; 

Figure 2 is a schematic representation of a distrib- 
uted broker architecture; and 

20 

Figure 3 shows the sequence of operations of sub- 
scription according to an embodiment of the inven- 
tion; 

25 Figure 4 shows the sequence of operations of a 

publish/subscribe distribution method implemented 
by a broker according to an embodiment of the 
invention. 

30 Figure 5 shows a linked list structure exemplifying a 
tabulated subscriber information storage structure 
within the broker of Figure 4. 

Detailed Description of Pref^n-ed Embodiments 

35 

[0030] The invention may be implemented in a het- 
erogeneous data processing and communications net- 
work. Application programs running on different 
computers within the network are enabled to communi- 

40 cate and hence interoperate using messaging products 
such as IBM Corporation's MQSeries message queuing 
software products. Message queuing and commercially 
available message queuing products are described in 
"Messaging and Queuing Using the MQr, B.BIakeley, 

45 H.Hams & R.Lewis, McGraw-Hill, 1994, and in the fol- 
lowing publications which are available from IBM Corpo- 
ration: "An Introduction to Messaging and Queuing"* 
(IBM Document number GC33-0805-00) and 
"MQSeries - Message Queue Interlace Technical 

50 Reference'' (IBM Document number SC33-0850-01 ). 
The network via which the computers communicate 
using message queuing may be the Internet, an 
intranet, or any computer networi^. IBM and MQSeries 
are trademarks of IBM Corporation. 

55 [0031] IBM's MQSeries messaging software prod- 
ucts provide transactional messaging support, synchro- 
nising messages within logical units of woric in 
accordance with a messaging protocol which gives 



9 



EP 1043 671 A2 



10 



assured once and once-only message delivery even in 
the event of system or communications failures. 
MQSeries products provide assured delivery by not 
finally deleting a message from storage on a sender 
system until it is confirmed as safely stored by a receiver 5 
system, and by use of sophisticated recovery facilities. 
Prior to commitment of transfer of the message upon 
confirmation of successful storage, both the deletion of 
the message from storage at the sender system and 
insertion into storage at the receiver system are kept in to 
doubt' and can be backed out atomically in the event of 
a failure. This message transmission protocol and the 
associated transactional concepts and recovery facili- 
ties are described in international patent application 
WO 95/1 0805 and US patent 5465328. which are incor- 15 
porated herein by reference. 

[0032] The message queuing inter-program com- 
munication support provided by the MQSeries products 
enables each application program to send messages to 
the input queue of any other target application program 20 
and each target application can asynchronously take 
these messages from its input queue for processing. 
This provides for assured delivery of messages 
between application programs which may be spread 
across a distributed heterogeneous computer network. 25 
but there can be great complexity in the map of possible 
interconnections between the application programs. 
[0033] This complexity can be greatly simplified by 
including within the network architecture a communica- 
tions hub to which alt other systems connect, instead of 30 
having direct connections between all systems. Mes- 
sage brokering capabilities can then be provided at the 
communications hub to provide intelligent message 
routing and integration of applications. IVIessage broker- 
ing functions typically include the ability to route mes- 35 
sages intelligently according to business rules and 
knowledge of different application programs' information 
requirements, using message 'topic* information con- 
tained in message headers, and the ability to transform 
message formats using knowledge of the message for- 40 
mat requirements of target applications or systems. 
Such brokering capabilities are provided by IBM Corpo- 
ration's MQSeries Integrator software product, provid- 
ing services for messages which are exchanged 
between application programs using IBM's MQSeries 45 
messaging products. 

[0034] If we adopt an hierarchical view of these var- 
ious components with application programs at the top 
closest to the user, then a message broker component 
sits at the next layer down between the application pro- 50 
grams and the message delivery software. 
[0035] According to the present invention, a mes- 
sage broker software product or software component 
can be adapted to support the publish/subscribe mech- 
anism for information delivery and to extend that mech- 55 
anism to include more complex processing of message 
content. 

[0036] Figure 1 shows an overview of message 



flows between publishing application programs 10 and 
subscribing application programs 20 via a message bro- 
ker 30 according to the present invention. The message 
broker 30 according to the prefon^ed embodiment of the 
invention is implemented as a software component 
which interoperates with a message delivery software 
product 90 running on a computer system 1 00 to which 
all publishing applications connect via a communica- 
tions network. However, the brokering functions could 
be distributed across a plurality of message broker com- 
ponents connected via the network, as discussed in the 
next paragraph. Thus, the message broker (whether a 
single component or distributed) is connected to receive 
all published messages. Each application program is 
associated with a message delivery product 90' which 
handles the complexities of inter-program communica- 
tion across the network, and each message broker is 
also associated with a message delivery product 90 on 
its computer system 100. 

[0037] As noted above, and represented in Figure 
2, instead of a single broker at the communications hub, 
the hub may consist of a distributed collection 35 of bro- 
kers 30 which communicate with each ottier. These bro- 
kers 30 may share subscription lists and provide a 
redundant, scalable distributed broker infi^astructure. 
The message broker services provided by such a dis- 
tributed broker infrasti'ucture can be enhanced by using 
an identification of message content characteristics to 
select one of a plurality of processors within a distiib- 
uted broker environment with reference to its functional 
capabilities and the identified message characteristics 
such that processing of tiie message is optimised. For 
example, if one part of the message broker function is 
located on a system having a direct connection to a 
database, the efficiency of SQL queries may be opti- 
mised by passing all SQL queries which are issued 
within a region of the networic to that system. If a second 
part of the message broker function is located on a sys- 
tem which has specialised processing capabilities for 
Images, then all messages witiiin tiie relevant network 
region containing images may be sent to that system. 
[0038] For ease of understanding, the following 
description will refer to a single message broker located 
on a single data processing system to which all puth 
lished messages are sent. 

[0039] A user of a subscriber application initiates 
130 a request to register the subscriber application with 
the broker, the request including information entered by 
the user regarding the subjects for which the user 
wishes to receive messages. For example, the user can 
specify "STOCK/COMPUTERS/*" for stock prices of all 
computer companies, or "VIBM" for all messages which 
relate to IBM Corporation, or can specify a requirement 
to receive all messages firom a particular subscriber. 
The request to register is generated at the subscriber 
system and this triggers a message service agent 120 
(a software component) at the subscriber system to 
query 140 a predefined set of the subscriber system's 
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capabilities via a system level API. This can be a single 
API call which pronfipts retrieval of a comprehensive 
predefined set of system capability infonnation. In this 
case, the query is only run when a first subscription reg- 
istration request is initiated by a given subscriber sys- 
tem, unless the capability information being obtained by 
the query includes information which changes over time 
(such as the amount of memory space currently availa- 
ble for use by the subscriber application). 
[0040] Alternatively, the message service agent 
may be adapted to issue a number of specific API calls, 
one for each required element of capability information, 
in dependence on the type of subscriber request. If the 
capability information queried is specific to the particu- 
lar subscription request, then the information may need 
to be supplemented and/or updated for subsequent 
subscription requests. With this embodiment, no query 
may be run at all for certain types of subscription 
request. 

[0041] The retrieved capabilities information and 
the existing generated request are then packaged by 
the message service agent 120 as an XML (Extensible 
Markup Language) structured message and sent 150 to 
the broker. 

[0042] On receipt of a subscriber registration 
request, the broker maps 160 the system capability 
information included in the request to predefined 
classes of capability. For example, the maximum image 
resolution capability of a subscriber system may be 
256x128 pixels. The broker then maps this to one of the 
classes High, Medium or Low resolution. A similar map- 
ping to a predefined set of classes is performed for pixel 
depth, buffer sizes, and other parameters which can be 
categorized in this way. This ensures that the different 
types of processing which have to be performed for 
each different subscriber are limited to avoid the need to 
control unique processing operations In relation to each 
capability parameter for each different subscriber - pro- 
viding a balance between performance and flexibility. 
[0043] The broker then stores 170 the subject 
requirements of subscribers in tables of a database in a 
repository 1 1 0 in association with their categorised sys- 
tem capability information. Figure 5 shows an example 
table structure in which a first table 300 stores required 
subject information 320 in association with subscriber 
names 330 (user names). A second table 310 stores the 
subscriber names 330 together with the networic 
address 340 of the subscriber application programs, the 
subscriber system capability information 350 and infor- 
mation 360 regarding any processing requirements 
specified by the user or required by the application pro- 
gram. 

[0044] When a message is received by the mes- 
sage broker, a message analysis component 60 within 
the broker firstly investigates 180 the message header 
for a topic or subject identifier and/or identification of the 
sender, as is known in the art. A message publisher 
application may have identified a message topic or sub- 



ject, such as by entering "STOCK/COMPUTERS/IBM" 
or "STOCK/AUTOS/GM" into a relevant field of the mes- 
sage header to indicate that the subject of the message 
is stock prices for IBM Corporation or General Motors 

5 respectively. The message broker includes a rules 
engine 40 which compares 210 any topic identifier and 
sender identifier retrieved from the message header of 
a published message with a list of subscribers' informa- 
tion requirements to identify which subscribers wish to 

10 receive this message. 

[0045] Additionally, according to a preferred 
embodiment of the invention, the topic identifier is also 
compared 210 with a list of topic identifiers which are 
known to imply that there is non-alphanumeric message 

15 content. In one example implementation, message pub- 
lisher applications provide information to the broker (as 
part of a registration process) including a list of topic 
identifier codes which are to be used when the publisher 
is publishing messages including non-alphanumeric 

20 content. (Alternatively, the registration process could 
involve publishers specifying message topics which will 
only include text-based content, to indicate that no fur- 
ther analysis of the content of such messages is 
required to determine whether there is any non-textual 

25 content.) 

[0046] Then, unless the attempt to retrieve and 
interpret a message topic identifier enables a final 
determination of how to handle the message, the mes- 
sage broker's analysis component 60 scans 190 the 

30 content of the message, looking for byte sequences 
which are representative of particular content types. 
The analysis component 60 compares 200 byte 
sequences from the message content with dictionaries 
of byte sequences which are representative of certain 

35 content-type-spedfic structures. For example. GIF, TIFF 
and JPEG images each have characteristic byte struc- 
tures. The content scanning step can identify audio con- 
tent and images and resolve image formats. A message 
could also include program code, such as HTML 

40 (Hypertext Markup Language) or XML codes which 
describe the fomiat of data for display by an Internet 
Web browser. The message content scanning step 
could easily be implemented to recognise XML or HTML 
by reference to their respective dictionary of tags. 

45 [0047] Next, having identified published message 
content 200 and identified interested subscribers 210, 
the system capabilities information and other sub- 
scriber-specific and application-specific information is 
retrieved from the subscriber information tables. The 

50 rules engine 40 then compares 220,230 the identified 
message content with the stored subscriber application 
requirements and system capabilities of the identified 
interested subscribers and any stored information 
regarding particular processing which the user has 

55 requested. For example, an incoming message may 
contain an image file having a particular resolution such 
as 1280 X 1024, which the rules engine determines 
exceeds the 'Medium' resolution capability of a particu- 
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lar subsaiber system. The rules engine thus identifies a 
requirement to convert the Image to 'Medium' resolution 
(640x512). Similar requirements for processing are 
identified for other capability parameters. 
[0048] The broker now has sufficient information to 
generate 240 a processing task definition regarding the 
processing which must be performed to generate a 
derived message content which is to be sent to the sub- 
scriber application and which content will conform to the 
following requirements: 

matching the subject matter requirements of the 
user; 

- matching the system capabilities of the subscriber 
system; and 

- matching the format requirements of the subscriber 

application. 

[0049] The processing task definition is then used 
to select 250,260 and control the operation of a particu- 
lar processing module 70, or a plurality of processing 
modules, of the broker to perform 270 the required 
processing. A collection of message handler modules 
70, including the multimedia processing modules, are 
held as objects In an object library 80 and are instantia- 
ble in response to the broker's determination of a 
requirement for a specific processing module 70 to per- 
form its processing operations on a received message. 
The message broker 30 responds to a selection 250 of 
a message handler module 70 by checking whether that 
module is already running (i.e. has been instantiated on 
receipt of a previous message) and, if so, queuing the 
message in an input buffer for processing by that han- 
dler module. If the required handler module is not run- 
ning, then it is instantiated and the message Is passed 
to the input buffer of the handler module for processing. 
[0050] in one embodiment of the invention, all mes- 
sages which include image content and for which some 
processing of that content is required are passed 260 to 
a single image processor module. Similarly, all mes- 
sages including audio data which requires processing 
are passed to a single audio processing module and 
messages including other identifiable data types are 
passed to a respective processing module for that type. 
This may entail passing a single message through a 
sequence of processing steps each performed by a dif- 
fierent message handler module if the message includes 
a number of content types. Messages containing exe- 
cutable code may be passed to a processing module 
that performs authentication (for example, digital signa- 
ture verification or virus checking) prior to permitting 
publication, to protect subscribers from undesirable pro- 
gram code. 

[0051] The processing which is performed by the 
selected message handler module is determined by the 
specified requirements of individual subscriber applica- 
tions, the particular characteristics of the received mes- 
sage content, and the capabilities of the subscriber 



system. Using the example of XML message content 
given earlier, examination of the XML content could indi- 
cate the existence of specific multimedia content and 
reveal the characteristics of the content that would be 

5 relevant to a determination of how it should be proc- 
essed, and in some circumstances XML format could 
then also be used to provide one or more code modules 
that would be required for appropriately processing the 
identified content. 

10 [0052] The selected message handler module then 
performs 270 operations on the message content in 
accordance with the associated task definition, and 
passes the resulting derived message or messages 
back to the message broker for distribution 280 to the 

15 relevant subscribers. 

[0053] There Is a potential performance problem 
with the above described implementation of the inven- 
tion in that, even with the above-described classification 
of capability parameters Into predefined classes (such 

20 as High, Medium and Low), there is still potentially a dif- 
ferent set of processing operations being performed on 
each published message for each subscribing user and 
this has performance overheads. An alternative embod- 
iment of the invention generates multiple versions of the 

25 content of some messages (for example, where the 
message includes an image file having a resolution 
above a minimum threshold, generating High, Medium 
and Low resolution images) and only subsequently 
refers to individual system capabilities and application 

30 and user-requirements to determine which version of 
the image or other content should be sent to which sub- 
scribers. This can have significant performance bene- 
fits, especially if the generation of the multiple versions 
of message content is performed in parallel with 

35 retrieval of the system capability and other relevant 
Information. 

[0054] Furthermore, performance can be optimised 
for the particular network environment by selecting 
which message content characteristics should be modi- 

40 fied by processing prior to a comparison with subscriber 
capabilities and requirements and which characteristics 
should only be modified with reference to individual sub- 
scriber capabilities and requirements. An example of 
this is to set a flag when there is at least one active sub- 

45 scriber application which requires transcoding of text to 
audio, and if the flag is set to perform such transcoding 
to generate an audio file prior to comparing an individ- 
ual message's content with individual subscriber 
requirements. This can avoid having to replicate the 

50 transcoding step as part of the personalised processing 
for particular subscribers. More generally, flags can be 
set at the broker to Indicate whether there are active 
subscriber systems having particular system capabili- 
ties and to generate different versions of message con- 

55 tent in accordance with the subscriber system 
capabilities of the active subscribers prior to checking 
message content against the system capabilities of spe- 
cific interested subscribers. 
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[0055] The above examples of alternative embodi- 
ments of the invention show that there is considerable 
fl^ibility in relation to the sequence of events. There is 
similar flexibility in relation to the architecture of the bro- 
ker, since particular operations such as transcoding 
could be performed by a separate transcoding service 
or at various points within the message flow. 
[0056] The advantages of the present invention will 
now be described with reference to a number of exam- 
ples of message processing and brokering according to 
the invention. 

[0057] A first example is where there are limited 
storage capabilities at a subscriber system, whereas 
the broker is located on a server system running a data- 
base. It may be appropriate to send to a subscriber 
application program a pointer which will enable a user of 
the subscriber application to access the database via 
the broker when required without having to replicate 
records of the database onto the subscriber system 
unnecessarily The message sent to the subscriber by 
the broker in response to a published message aniving 
from a publisher may include a SQL expression for sub- 
sequently querying the database on the broker's server, 
or it may include a URL for accessing a Web page via 
the Internet or an intranet. It should be note that a bro- 
ker for an Internet Web server may have a great range 
of different client systems sending requests to it, and so 
the present invention's ability to take account of sub- 
scriber system capabilities is very relevant to the Inter- 
net environment. 

[0058] A second example use of the invention is for 
reducing the size of text documents to conform to the 
available text buffer size of the subscriber system. 
SGML and XML are example languages tor describing 
information content which allow an intelligent mark-up 
of sections of a document (abstract, title, figures, 
authors, etc) so that it is possible to request that only 
abstracts, for example, of documents be delivered to 
your subscriber application If you know the vocabulary 
of the tagged documents. If only lOkb is available for 
storing a text document, the subscription may require 
processing of larger documents to reduce them to lOkb 
or less. As an alternative to selection of the abstract, the 
broker may call algorithms which perform more complex 
text processing such as generating a precis. 
[0059] For a third example, consider a computer 
system which is set up to publish messages including 
images of the surface of the earth captured by an orbit- 
ing satellite. The organisation obtaining the information 
is able to recoup some of the costs of retrieving and 
processing their data by offering a subscription service 
to other organisations interested in receiving particular 
data contained within the captured images. A publisher 
application program at this first system periodically gen- 
erates messages containing the raw captured images 
and sends these messages to the message broker. The 
message broker analyses 180-200 the messages on 
receipt and identifies the presence of images within the 



messages (by the combination of comparing any recog- 
nised message topic identifier with known topics and 
scanning message content, as described above). 
[0060] The message broker uses the topic identifier 

5 to check 210 records of subscriber requirements - 
determining which subscribers have registered a 
requirement to receive messages related to this topic 
and determining what specific data requirements have 
been specified by those subscribers. As an example, a 

10 first subscriber may have registered with the message 
broker with a requirement for the subscriber to be set a 
message whenever the message broker receives mes- 
sages including a topic identifier naming the specific 
publisher organisation. The subscriber may also have 

15 specified that all received messages from that publisher 
organisation should have their image content proc- 
essed to generate a derived image showing classifica- 
tions of crops identifiable from the captured raw image. 
Furthermore, the subscriber system's capabilities may 

20 impose requirements for particular image processing 
operations. One example Is the pixel depth of the sub- 
scriber system display, which may be bilevel, 256 level 
greyscale, or 24 bit colour - mapped by the broker to 
one of a set of pixel depth classes (High, Medium or 

25 Low). Having recognised the presence of image con- 
tent, the message broker accesses the system capabil- 
ities Information and user-specified requirements and 
then generates a task definition which describes the 
operations to be performed to process the received 

30 message's image content in the required way. 

[0061] The received message and the processing 
task definition are then sent 260 to a suitable image 
processing message handler module. This message 
handler module performs 270 the defined sequence of 

35 processing operations and then passes back to the 
message broker a derived message which includes the 
modified image data. 

[0062] The message broker then fonwards 280 the 
derived message to the subscriber or subscribers who 

40 specified this information requirement. This routing of 
the final derived message or messages to subscribers 
in accordance with subscriber's Information require- 
ments (under the control of the rules engine 40) is very 
similar to the routing performed by prior art message 

45 brokers. Thus, the present invention enables content- 
type-specific processing of message content in accord- 
ance with subscriber system capabilities such that the 
content of messages sent to subscribers matches their 
system capabilities as well as the user-specified and 

50 application-Specific requirements. The publish/sub- 
scribe broker according to the invention also provides 
the ability to perfonm appropriate processing of a mes- 
sage without relying on the inclusion of a single specific 
message structure. These advantages are provided 

55 within the context of an otherwise conventional pub- 
lish/subscribe system which uses business logic for 
Identification of relevant subscribers. 
[0063] In the preferred embodiment of the inven- 
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tion, the message content type and system capabilities 
can be used to select a message handler module with- 
out reliance on a topic identifier in the message header. 
The analysis component 60 of the message broker 
scans message content (and this can be done either in 5 
alt cases or only if querying of the message header is 
not sufficient to determine which message handler 
module to select). In one specific example implementa- 
tion, when an image is identified within the message 
content by this scanning step, the image can undergo a 10 
watermark extraction process followed by an analysis of 
the watermark to determine whether this includes infor- 
mation sufficient to identify what message handler mod- 
ule should be selected. This watermark together with 
the subscriber system capabilities and stored sub- 15 
scriber requirements determine what specific process- 
ing operations should be performed on the image. 
[0064] Similar watermark extraction and analysis to 
determine required processing operations can also be 
implemented for audio data within a message. One 20 
example is where a broker is connected to receive mes- 
sages containing waveform data representing the mate- 
rial broadcast by a radio station. The broker recognises 
that the message includes audio content and passes 
the message to a particular message handler module 25 
which performs the following functions: retrieves a 
watermark from the audio data, compares the water- 
mark with a database of copyright owners and their 
material to identify the relevant owner and material, and 
generates a message to be published to the copyright 30 
owner informing them that a particular copyright work 
has been broadcast by a particular radio station at a 
particular date and time. This will allow the copyright 
owner to seek royalty payments. The subscribing user 
may have specified, or the subscriber application may 35 
require, an output type such as an e-mail memo and so 
specific processing may be required to significantly 
transform the message into the required type. 
[0065] The message broker according to the pre- 
ferred embodiment of the invention is able to employ 40 
logic which relies on an analysis of text-based informa- 
tion in a message header and text-based data within the 
content of a message to determine whether a particular 
subscriber should receive a message in response to 
receipt by the broker of an incoming message. That is. 45 
in addition to the multimedia processing capability of 
apparatus implementing the invention, the processing 
performed by a broker according to the invention may 
include processing subscriber information requirements 
such as a requirement to be notified of the stock price of so 
a company (COMPANYNAME) when the stock price 
exceeds a tiireshold price. The subscriber may specify 
"STOCK/COMPUTERS/COMPANYNAME- as a topic 
identifier to be identified from message headers and 
"PRICE: >$1 50" as a filter which requires scanning of a 55 
share price field within the content of a received mes- 
sage to determine whether a message should be sent 
to that subscriber. The content analysis can also be 



implemented by queries which use the Structured 
Query Language notation or similar to test whetiier cer- 
tain conditions are true (for example, "the value of COM- 
PANYNAME stock is > $150"). 
[0066] In a further example of operation of a mes- 
sage broker implementing the invention, content que- 
ries may involve multimedia content analysis following 
on from the multimedia identification described above. 
This goes far beyond the simple business logic which is 
known in prior art systems. For example, subscribers 
may register an interest in receiving any images which 
include a red circle towards the top right of the image as 
a way of investigating possible trade mark Infiringement, 
or register an Interest in receiving any image which Is 
sufficiently similar to an example image or set of images 
which are representative of certain classes of image. 
[0067] Technology is available (for example, IBM's 
Query By Image Content (QBIC) technology described 
in US patent 5579471 ) which allows image matching in 
response to such example images or user descriptions 
of features of an image tiiat tiiey want to match. The 
incoming image would be run as a query against a 
QBIC catalogue comprising the class representatives, 
the results being a set of distance measures. Fuzzy 
matching algorithms establish how close an image 
within a message is to the examples images or 
requested features. The subscription registration mech- 
anism in a traditional publish/subscribe broker can tiius 
be enhanced using multimedia feature definitions and 
multimedia feature comparisons to allow only tiiose 
messages that meet the subscription criteria (within a 
given tolerance defined by specific thresholds) to be 
delivered to the subscribing user. 
[0068] The QBIC example shows that tiie identifica- 
tion of message content characteristics can comprise 
separate steps of recognising the existence of a certain 
message content type (typically using precise matching 
of a given byte sequence) and then identifying image 
content characteristics beyond those which are inherent 
in tiie recognised byte sequence (such as whetiier the 
content matches defined parameters or example 
images within certain tolerances). 
[0069] Following these steps, a decision is made 
with reference to subscriber system capabilities as to 
what processing to perform and how to do it. There is 
considerable flexibility within the scope of tiie invention 
regarding how to implement this step. The decision can 
employ a stored and relatively inflexible method or a 
dynamic approach that relies on periodic input from 
both publisher and subscriber The subscriber may 
specify when registering as a subscriber some particu- 
lar processing operations that it requires to be per- 
formed on messages to generate derived messages. 
Examples of this were described eariier. 
[0070] The function of identifying message content 
characteristics and the logic for determining which of a 
set of message handler modules to pass the message 
to and for generating a task definition for controlling the 
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operation of the message handler modules (and prefier- 
abiy also the actual processing functions of one or more 
message handler modules) may be implemented within 
a replaceable software component which Interoperates 
with and controls the operation of a message broker. $ 
This separation of certain functions into a replaceable 
component (such as a Java Bean or ActiveX compo- 
nent) enables the behaviour of a message broker to be 
modified according to the particular software compo- 
nent which is used with the broker. It also enables new 
capabilities to be added over time, such as by loading 
new message feature dictionaries into an object library 
when they become available and thereby enabling the 
message broker to recognise new message content 
types. 

[0071] It will be clear to a person having ordinary 
skill in the art that a number of modifications to the 
above described embodiments of the invention can be 
made within the scope of the invention. For example, 
the querying of a subscriber system's capabilities may 
be triggered by a request sent from the broker after the 
broker receives a request for subscriber registration 
instead of before, although it is preferred not to overload 
the broker with unnecessary processing. The querying 
of system capabilities may be implemented by a proc- 
ess implemented within the messaging manager as an 
alternative to a specialised separate message service 
agent Furthermore, the above description refers to a 
number of different examples of capability information. 
Particular implementations of the Invention may use 
only a single capability characteristic, and other imple- 
mentations may take account of any number of different 
capabilities. 

[0072] In one embodiment, the broker is provided 
with stored information of system capabilities for a 
number of types of subscriber system, and a subscriber 
system merely Includes with a subscription request an 
identifier of its system type. This is sufficient for the bro- 
ker to identify which of the stored capabilities informa- 
tion applies to particular subscriber systems. 

Claims 

1. A message broker for providing brokering services 
for a publish/subscribe distribution service within a 45 
communications network, wherein the broker is 
adapted to receive and save information firom sub- 
scriber systems within the networic regarding their 
system capabilities, and wherein the broker is 
responsive to receipt of messages from publisher so 
application programs within the networtt to modify 

the content of the messages to conform to the sub- 
scriber system capabilities and then to send 
derived messages including the modified message 
content to subscriber application programs at the ss 
subscriber systems. 

2. A message broker according to claim 1 , wherein 



the message broker includes components for per- 
forming the following operations: 

analyzing messages received from publisher 
application programs to identify characteristics 
of the message content; 



accessing information regarding subscriber 
requirements and subscriber system capabili- 
ties; 

comparing message content characteristics 
with tiie subscriber requirements information to 
identify Interested subscribers; 

comparing message content characteristics 
with the capabilities information for subscriber 
systems of interested subscribers to determine 
which of the plurality of derived messages 
should be sent to each Interested subscriber; 
and 

sending to interested subscribers a selected 
one of the derived messages which conforms 



accessing information regarding subscriber 
requirements and subscriber system capabili- 

10 ties; 

comparing message content characteristics 
witii the subscriber requirements information to 
identify interested subscribers; 

15 

comparing message content characteristics 
with the capabilities information for subscriber 
systems of interested subscribers to determine 
whether they conform; 

20 

responsive to determining that the message 
content characteristics do not conform to a 
subscriber system's capabilities, modifying the 
message content to conform to the subscriber 
25 system capabilities and sending to a sub- 

scriber application program at the subscriber 
system a derived message including the modi- 
fied message content. 

30 3. A message broker according to claim 1 , wherein 
the message broker includes components for per- 
forming the following operations in response to 
receipt of a message from a publisher application 
program: 

35 

analyzing the message to identify characteris- 
tics of the message content; 

generating a plurality of derived messages hav- 
40 ing modified message content; 
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to the capabilities of the respective subscriber 
system. 

4. A message broker according to any one of the pre- 
ceding claims wherein the subscriber system capa- 
bilities information stored by the broker includes 
image and/or video presentation capabilities. 

5. A message broker according to any one of the pre- 
ceding claims wherein the subscriber system capa- 
bilities information stored by the broker Includes 
audio playback capabilities. 

6. A message broker according to any one of the pre- 
ceding claims, including means for analyzing byte 
sequences of the content of messages received 
from publisher application programs to identify 
structures representative of particular types of mul- 
timedia message content. 

7. A distributed message broker infrastructure com- 
prising a plurality of message broker components 
according to claim 1 , each for running on one of a 
plurality of interconnected computer systems, each 
message broker software component having 
access to information regarding the capabilities of 
each of said plurality of interconnected computer 
systems and each message broker component 
being adapted to select, in accordance with identifi- 
cation of message content characteristics of 
received messages and in accordance with the 
capabilities of the plurality of interconnected com- 
puter systems, one of said computer systems for 
processing a message. 

8. A method, implemented by a message broker 
which is adapted to provide brokering services 
within a network, for modifying the content of mes- 
sages received from publisher application pro- 
grams prior to sending messages to subscriber 
application programs, the method including: 

analysing the content of received messages to 
identify content characteristics; 

comparing identified content characteristics 
with stored information regarding subscriber 
requirements to Identify Interested subscribers; 

comparing identified content characteristics 
with stored information regarding subscriber 
system capabilities to determine processing 
requirements for modifying the message con- 
tent to conform to the subscriber system capa- 
bilities; 

generating one or more derived messages 
Including modified message content; and 



sending to the subscriber systems derived 
messages including message content which 
conforms to the subscriber system capabilities. 

5 9. A method according to claim 8, including generat- 
ing a plurality of derived versions of the received 
message content prior to the step of comparing 
identified content characteristics with subscriber 
requirements, wherein the determination of 

10 processing requirements is used for selection of 
one of the plurality of derived content versions for 
sending to a subscriber system. 

10. A method according to claim 8, wherein the step of 
15 generating one or more derived messages is per- 
formed subsequent to and in accordance with the 
determination of processing requirements such that 
the broker generates derived messages which con- 
form to the subscriber system capabilities. 

20 

11. A computer program for controlling the operation of 
a computing device on which it is run, to execute 
processes to perfomn the following operations: 

25 in response to Initiation, by a user of the device, 

of a request for an application program on the 
device to subscribe to a publish/subscribe dis- 
tribution service such that the subscriber appli- 
cation program will receive messages from a 

30 message broker which provides brokering 

services for the publish/subscribe distribution 
service, querying the system capabilities of the 
computing device; and 

35 sending the capabilities information for the 

computing device to the message broker in 
association with the subscription request. 

12. A data processing system for connection within a 
40 communications networic, the system including a 

message broker according to claim 1 for providing 
brokering services for a publish/subscribe distribu- 
tion service. 

45 13. A distributed data processing apparatus comprising 
a first set of data processing systems having 
installed thereon application programs adapted for 
publishing messages, the first set of data process- 
ing systems being connected for sending published 

50 messages to a communications hub having mes- 
sage broker software installed thereon, the commu- 
nications hub being connected to a second set of 
data processing systems having Installed thereon 
application programs adapted to subscribe to the 

55 message broker software as subscriber application 
programs of a publish/subscribe distribution serv- 
ice, wherein the message broker software com- 
prises a message broker according to claim 1 . 
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14. A message broker for supporting a publish/sub- 
scribe service by receiving messages from pub- 
lisher application programs and then fbnA^rding 
messages to subscriber application programs in 
accordance with subscriber requirements, wherein $ 
the message broker component includes: 

a message analysis component for Identifying 
non-alphanumeric message content within 
messages received from publisher application io 
programs; 

a selector for using the identification of non- 
alphanumeric message content to detenmine 
which of a set of message handler components is 
to pass the message to; 

a set of selectable message handler compo- 
nents for processing received messages to 
generate one or more derived messages; 20 

means for accessing a repository storing 
requirements of subscriber application pro- 
grams; and 

25 

means, responsive to requirements of sub- 
scriber application programs stored in said 
repository, for passing derived messages to 
subscriber application programs which require 
the derived messages. 30 

15. A message broker according to claim 14, which is 
adapted to access the repository of subscriber 
requirements to determine what processing opera- 
tions are to be performed on received messages by 35 
the message handler components. 

16. A message broker according to claim 14 or daim 
15, wherein the selector is adapted to select one of 
the set of message handler modules in response to 40 
an Identification by the message analysis compo- 
nent of the type of non-alphanumeric message con- 
tent within a received message. 

17. A message broker according to daim 16, wherein 45 
the means for Identifying non-alphanumeric mes- 
sage content includes means for analysing byte 
sequences within the message content to identify 
structures representative of particular types of mul- 
timedia message content. 50 

18. A message broker according to any one of daims 
14 to 17. wherein the message analysis component 
Is adapted to implement a watermark retrieval oper- 
ation for received messages and to analyse 55 
retrieved watermarks to identify information con- 
tained therein which facilitates a determination of 
which of the set of message handler components to 
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pass the message to. 

19. A message broker according to any one of claims 
14 to 17, wherein one or more of said message 
handler modules is adapted to implement a water- 
mark retrieval operation for messages passed to 
that message handler module and to analyse 
retrieved watermarks to Identify information con- 
tained therein which facilitates a determination of 
what processing operations are to be performed on 
the message by the message handler module. 

20. A process controller component for a message bro- 
ker according to daim 14, wherein the process con- 
troller component indudes the message analysis 
component and the message handler component 
selector. 

21. A message broker software component supporting 
a publish/subscribe service, for controlling the oper- 
ation of a computer on which it runs to perform the 
following steps: 

identifying non-alphanumeric content of mes- 
sages received from publisher application pro- 
grams; 

using the identification of non-alphanumeric 
message content to determine which of a set of 
message handler modules to pass the mes- 
sage to; 

processing the message using the selected 
message handler module to generate one or 
more derived messages; and 

sending a derived message to subscribers 
which require the derived messages, as deter- 
mined by stored subscriber requirements. 

22. A distributed message broker infrastructure com- 
prising a plurality of message broker software com- 
ponents according to daim 21, each for running on 
one of a plurality of interconnected computer sys- 
tems, each message broker software component 
having access to information regarding the capabil- 
ities of each of the plurality of Interconnected com- 
puter systems and each message broker software 
component being adapted to select, in accordance 
with the identification of non-alphanumeric content 
of received messages and in accordance with the 
capabilities of the plurality of interconnected com- 
puter systems, one of said computer systems for 
processing a message. 

23. A method, implemented by a message broker 
within a communications networic, of processing 
messages received by the message broker from 
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publisher application programs and publishing mes- 
sages to subscriber application programs in 
response to said received messages, the method 
including the steps of: 

5 

analysing received messages to identify non- 
alphanumeric message content; 

selecting, using the identification of non-alpha- 
numeric message content, one of a set of mes- io 
sage handler components to pass each 
received message to; 

passing each received message to the 
selected message handler module; is 

performing the steps of retrieving information 
requirements of subscriber application pro- 
grams from a repository storing said informa- 
tion requirements and processing the received 20 
messages using the selected message handler 
modules; and 

responsive to the retrieved information require- 
ments of subscriber application programs, 25 
passing processed messages to subscriber 
application programs which have a require- 
ment to receive the processed messages. 

24. A computer system including a message brolcer 30 
component for supporting a publish/subscribe serv- 
ice on behalf of application programs connected 
thereto via a communications network, wherein the 
message broker component includes: 

35 

a message analysis component for identifying 
non-alphanumeric message content within 
messages received from publisher application 

programs; 

40 

a selector for using the identification of non- 
alphanumeric message content to determine 
which of a set of message handler modules to 
pass the message to; 

45 

a set of selectable message handier compo- 
nents for processing received messages to 
generate one or more derived messages; 

a repository for storing requirements of sub- so 
scriber application programs; and 

means, responsive to requirements of sub- 
scriber application programs stored in said 
repository, for passing derived messages to 55 
subscriber application programs which require 
the derived messages. 



14 



EP 1 043 671 A2 





16 



EP 1 043 671 A2 



SUBSCRtBER INITIATES SUBSCRIPTION REQUEST 130 

r ^ 

MESSAGE SER VICE AGENT QUERIES CAPABILITIES -KO 

REQUEST AND CAPABILIT IES ARE SENT TP BROKER KiSO 

T 

BROKER MAPS CAiVSBILITIES TO PREDEFINB)CLA^ ES>^160 

T 



BROKER STORES SUBSCRIPTION REQUIREMENTS 
WITH CAPABILITY CUSSES 



-170 



FIG. 3 



17 



EP 1 043 671 A2 



ANALYZE 
RECEIVED 
MESSAGE 



CHECK TOPIC IDENTIFIER 180 

190 



I 



ANALYZE CONTENT BYTE 
SEQUENCES 



i 



DETERMINE 
SUBSCRIBERS 
AND REQUIRED 
PROCESSING 



COMPARE WITH DISTRIBUTION 
LIST TO IDENTIFY SUBSCRIBERS 



I 



210 



COMPARE WITH SYSTEM CAR^BILITIES | — 220 
♦ — — i 



DETERMINE OTHER SUBSCRIBER 
REQUIREMENTS 

I 



.230 





COMPARE WITH DICTIONARIES 
TO IDENTIFY CONTENT 


— 200 















GENERATE PROCESSING TASK DEFINITION | — ^240 



SELECT HANDLER MODULE 

I 



•250 



FORWARD MESSAGE CONTENT TO HANDLER MODULE 1^260 





GENERATE DERIVED MESSAGES h ^270 



SEND MESSAGES TO SUBSCRIBER APPLICATIONS | 280 



FI6.^ 



18 



EP 1 043 671 A2 



330 



320 



300, 



SUBSCRIBER 
NAME 


TOPICS 























310, 



SUBSCRIBER 
NAME 


DESTINATION 
APPLICATION ADDRESS 


SYSTEM 
CAPABILITIES 


SUBSCRIBER 
REQUIRMENTS 











































FIG. 5 



19 



(19) 



J 



(12) 



Europaisches Patentamt 
European Patent Office 
Office europ^en des brevets 



(88) Date of publication A3: 

16.07.2003 Builetin 2003/29 



(11) EP 1 043 671 A3 

EUROPEAN PATENT APPLICATION 

(51) intci.7: G06F 17/60. G06F 9/46 



(43) Date of publication A2: 

11.10.2000 Buiietin 2000/41 

(21) Application number: 00302178.9 

(22) Date of filing: 17.03.2000 



(84) Designated Contracting States: 


(72) Inventors: 


AT BE CH CY DE DK ES Fl FR GB GR iE iT LI LU 


• Bird, Colin Leonard 


MC NL PT SE 


Winchester, Hannpshire S021 2JN (GB) 


Designated Extension States: 


• ibbotson, John Brian, iBIM UK Ltd., IPL 


AL LT LV IMK RO Si 


Winchester, Hampshire S021 2JN (GB) 


(30) Priority: 19.03.1999 GB 9906231 


(74) Representative: Jennings, IVIichael John 


04.06.1999 GB 9912901 


iBiM United Kingdom Limited, 




intellectual Property Department, 


(71) Applicant: international Business IMacliines 


Hursley Parle 


Corporation 


Winchester, Hampshire S021 2JN (GB) 


Armonlc, NY 10504 (US) 





(54) Message broker providing a publish/subscribe service and method of processing messages 
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(57) Provided is a message broker and a method of 
processing message content within a publish/subscribe 
message distribution service. The message broker and 

method extends publish/subscribe service capability to 
heterogeneous communications networks in which dif- 
ferent subscriber systems can have very different capa- 
bilities in terms of their multimedia presentation capabil- 
ities, processing capabilities and storage capabilities. 
The broker modifies the content of messages received 
from publisher application programs to conform to the 



subscriber system capabilities prior to distribution to 
subscriber applications. The broker is adapted to iden- 
tify characteristics of the content of received messages 
and to use this information to determine which of a set 
of message handier modules to pass the message to. 
Derived messages are generated by the selected mes- 
sage handler module and sent to subscribers for which 
this message type or message content has been re- 
quested. A multimedia processing module is preferably 
selected in response to identification of certain multime- 
dia content. 
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